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We present the MIM calculus, a modeling formalism with a strong biological basis, which provides 
biologically-meaningful operators for representing the interaction capabilities of molecular species. 
The operators of the calculus are inspired by the reaction symbols used in Molecular Interaction 
Maps (MIMs), a diagrammatic notation used by biologists. Models of the calculus can be easily de- 
rived from MIM diagrams, for which an unambiguous and executable interpretation is thus obtained. 
We give a formal definition of the syntax and semantics of the MIM calculus, and we study proper- 
ties of the formalism. A case study is also presented to show the use of the calculus for modeling 
biomolecular networks. 



1 Introduction 

The use of formal methods in Systems Biology provides important advantages in the description and 
analysis of biological systems, since the structure and behavior of biological systems can be described 
unambiguosly and different analysis techniques can be applied to their study. In this field, the most 
influential approach has been proposed by Regev, Shapiro and others in Il26ll27ll22ll23l . where the %- 
calculus process algebra Ifl9ll20ll is used to formalize biomolecular processes. Afterwards, many other 
formalisms originally developed by computer scientists to model systems of interacting components 
have been applied to Biology l25l[T2ll22ll28ll . and extended to allow more precise descriptions of the 
biological behaviors. Other formalisms have also been developed expressly for being used in Biology 

a in s 13 in in ma \m m m m u m . 

Biologists have introduced graphical languages for describing bioregulatory networks. As an exam- 
ple, we quote Molecular Interaction Maps (MIM) ifTTI . MIM diagrams are composed of nodes, repre- 
senting molecular species, and edges connecting nodes, which represent the possible reactions among 
species. Edges can express different kinds of reactions, according to the used reaction symbol. In this 
paper, we present a formalism which can be used for modeling and analyzing biological processes, called 
MIM Calculus (MIMc), which focusses on modeling the interaction capabilities of the involved elements. 
MIMc is defined in the style of process calculi, where each molecule appearing in the system is described 
by a term. However, unlike most of the previously proposed calculi for describing biological processes, 
which model reactions by means of process communication, MIMc provides high-level operators with a 
direct biological meaning. For example, there are operators for expressing the creation of a bond between 
two compounds (such as a complexation), and other biologically interesting events. 

The calculus has a strong relationship with Molecular Interaction Maps. The presented approach 
has a twofold advantage. On one side, we can exploit the features of process calculi such as incremental 
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definition of models, techniques for analysis and verification of properties, and easy development of sim- 
ulators. On the other side, the correspondence of the operators of the calculus with biological interactions 
allows an immediate translation of Molecular Interaction Maps into MIMc. Less immediate translations 
of Molecular Interaction Maps into more general formalisms can be found in ffl[5l|TT]]. Remark that the 
aim of the paper is to propose a calculus whose operators have a direct correspondence with the ones of 
Molecular Interaction Maps. Thus all the main MIM operators are considered, without any investigation 
about a minimal set of them able to encode all the others. 

The paper is structured as follows. After recalling the Molecular Interaction Maps, in section 2 
we introduce the MIM calculus. In section 3 we study the relationship between the MIM calculus and 
Molecular Interaction Maps, and we establish conditions under which a term of the calculus is a formal 
representation of a MIM diagram. In section 4 we show an example of modeling with the MIM calculus, 
and in section 5 we draw some conclusions. 



1.1 Molecular Interaction Maps 

MIM diagrams provide a static view of the molecular species in a system, and their possible interactions. 
Interactions are represented by lines connecting nodes representing species, and the meaning of each 
interaction depends on the symbol used to draw the line. Each molecular species can appear only once. 
Moreover, since the diagram is static, it does not contain any information about number of molecules 
(concentration) of the molecular species. 

Three classes of molecular species can be represented: elementary species (fig. EH), complex species 
(fig. HJ5) and DNA sites (fig. Hfc). Complex species represent either a combination of elementary species 
or a modified elementary species. Figure Q] shows a simple MIM diagram, containing the elementary 
species A and B which can interact. A named elementary species is drawn as a rounded box, containing 
its name. A complex molecular species, resulting from an interaction, is depicted as a bullet on the 
corresponding interaction line. For instance, in Figure [T] the complex species obtained by the binding of 
A and B is represented by the node x on the interaction line. 



B 



Figure 1 : An example of MIM diagram. 

MIM diagrams allow representing two kinds of interactions: reactions, which act on molecular 
species, and contingencies, which act on reactions or other contingencies. An interaction symbol rep- 
resents a possible interaction that can happen if certain state conditions hold. Interactions can have a 
kinetic constant k associated with them, that is used to model its "occurrence" rate. Conceptually, a 
higher kinetic constant means that the interaction is more likely to happen than an interaction with a 
lower kinetic constant. 

For defining MIM calculus we consider the reaction symbols shown in Figure [3] Note that they are 
only a subset of the reaction symbols available for use in a MIM diagram. 

• Non-covalent binding (Figure [3^): denotes the reversible binding of the two pointed species: a 
molecule of the first species can bind to a molecule of the second species, forming a compound. 
Two species joined in a non-covalent binding can eventually dissociate again. 

• Covalent modification (Figured): denotes the covalent modification of the pointed species; the 
modification type (such as phosphorylation or acetylation) is written at the tail. 
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Figure 4: Contingency symbols. 

Figure 2: Species in MIMs. Figure 3: React i on sym bols. 

• Covalent binding (Figure |3fc): denotes a covalent bond of the two connected species. 

• Cleavage of a covalent bond (Figure |3}1): denotes the possibility of a covalent bond at the head 
(right end) to be broken by the presence of the species at the tail (left end). This symbol points 
from a species to a reaction symbol representing covalent binding. 

• Stoichiometric conversion (Figure^): denotes the conversion of the species at the tail of the arrow, 
called reactant, into a corresponding number of product species, i.e. the species written at the tail 
of the arrow disappears, while the pointed ones appear. 

• Lossless production (Figure [3j): it is similar to the stoichiometric conversion, but without loss of 
the reacting species. 

• Degradation (Figure |3g): means that molecules of the species can disappear. 

The following contingency symbols, shown in Figure HI are provided by MIM diagrams: 

• Stimulation (Figure |4^): means that the molecule of the species at left end stimulates the pointed 
reaction; 

• Requirement (Figure |U)): means that the molecule of the species at left end is required in order for 
the pointed reaction to happen; 

• Inhibition (Figure the presence of the species at the tail (left end) inhibits the possibility for 
the pointed interaction to happen; 

• Catalysis (Figure @Jl): means that the pointed reaction have a much higher reaction rate if the 
species is present than if it is not. 



Interpretation of MIM diagrams There are three different interpretations for MIM diagrams |[T7l[T6l : 
explicit, combinatorial and heuristic. Each interpretation is suited to a different purpose, depending on 
the application. They differ in how interactions between indirectly connected species are considered. 
Figure |5]shows a small example of MIM, which explicitly shows the bindings between A and B, yielding 
A:B; the binding between B and C, yielding B:C; and the possible phosphorylation of B, yielding pB. 
Some questions arise, such as if a complex between A:B and C, yielding (A:B):C, can form. Or, similarly, 
if pB and A can bind. The three different interpretations address this issue, by stating which interactions 
are possible. 

A MIM diagram, in its explicit interpretation, depicts each possible reaction: an interaction line 
applies only to the molecular species directly connected to it. In this interpretation the order of bindings 
can be easily extracted from the diagram. In Figure[5l the explicit interpretation only allows the formation 
of A:B, B:C and pB. Explicit maps can be built using only a subset of MIM symbols: all contingencies 
symbols may, on the whole, be represented by a set of reaction symbols ( |[T7l ). Explicit maps without 
contingencies can be readily used for computer simulation. 
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Figure 5: An example of MIM diagram. 

Besides the complexes that are allowed by the explicit interpretation, in the combinatorial interpreta- 
tion of MIM diagrams, an interaction line represents an implicit set of complexes and, hence, of reactions. 
In particular, each interaction line represents all those reactions between the interacting species, in each 
possible combination of their binding and modification states. In Figure [51 the combinatorial interpreta- 
tion always allows A and B to bind, regardless of the fact that B is free, bound to C, or phosphorylated. 
This "transitivity" means that an interaction symbol applies indirectly to species through other interac- 
tion symbols. A main advantage of the combinatorial interpretation is this ability to synthesize with a 
few symbols a large number of possible complexes and reactions, making MIM a compact notation. 

Finally, like the combinatorial interpretation, the heuristic one allows all the complexes that are 
permitted by the explicit interpretation, with the difference that it does not specify whether each of 
the combinatorial possibilities may or may not occur, either because of lack of knowledge or because 
some contingency symbols have been omitted to avoid overcrowding the diagram. Thus, heuristic MIM 
diagrams are used to depict only what is known, leaving unspecified what still has to be discovered. 

2 MIM Calculus 

In this section we formally introduce the syntax and semantics of the MIM calculus. MIM calculus is 
defined in the style of process calculi, where an agent represents a molecule of a certain named species. 
Names A,B,C, ... are used to identify the different elementary species, and we denote by $ the set 
of names of elementary species. We also assume a set S c whose elements denote types of covalent 
modifications (such as phosphorylations). 

Definition 2.1 (Syntax). Processes P, named species S and capabilities p. of the MIM calculus are defined 
by the following grammar: 



p 


::= S P 


P 


7 »= (v,i) 


N 

— >M 


( non-covalent binding ) 


s 


::= p. IS 




I (v,i) 


N 


(covalent binding) 


IS 


::= A S:S 


qS SS 


I (v,i) 


i 


(covalent modification) 


M 


::= recx.p M 


X 


I (v,0 


N 


(cleavage) 


M 


::= M + M 


1 y 


I (v,i) 


>P 


(conversion) 



(v,l) — >P (lossless production) 



where is the empty process, A G S denotes an elementary species name, q € S c denotes the type of 
modification, x € X is a variable, and N,v,l denote species names, which are elements of the set J/ 
of terms S without capabilities. For the sake of legibility we shall often use round brackets and we shall 
systematically enclose capabilities in curly brackets. 



Barbuti, Maggiolo-Schettini, Milazzo, Pardini, Rama 



39 



Terms P of the calculus are made of a composition of molecules 5, by means of the parallel operator 
_ | _. Each molecule is of the form pt .IS, where IS describes the structure of the molecule, and pt describes 
its interaction capabilities. In particular, IS denotes either an elementary molecule of species A, or a 
compound molecule. In the case of compound molecules, IS is made of the single molecules forming 
the compound, combined by means of different syntactical operators specifying the kind of bond that 
keeps the molecules together: a non-covalent bond S\ : S2 between the species S\ and 52, a covalent 
modification qS of species 5, or a covalent bond S1S2 between S\ and 52- Note that the capabilities of 
each molecule forming a compound are retained in the compound description. 

For example, term {7} .A models a molecule of species A, having a single interaction capability 7. 
A complex formed of two simple molecules A and fl can instead be represented as /^.(/^.A : jU3.fi), 
where pt\ are the capabilities of the compound, and pt 2 , /I3 are the capabilities of molecules A and fi, 
respectively. 

We denote the set of species by 5?, and identify jV C 5? as its subset of named species without 
capabilities, i.e. where each \i is empty (pt = 0). We assume a function [_-J : — > JV that strips all the 
capabilities from a named species 5 € . For example, \}i\.(\iiA : jU3.fi) J = 0.(0.A : 0.B). Moreover, 
we often avoid writing empty capabilities when no ambiguities arise, therefore we simply write A : fi 
instead of 0.(0.A : 0.fi). This function is extended to processes |_-J : 8? — > P(^) as \S\ \ ■■ ■ | 5„J = 

L5 1 ju---uL5„J. 

The calculus allows expressing different capabilities for molecules. Operator — >;U means that a 
species can form a non-covalent bond with a molecule of species name N. The result will be a compound 

N 

of the form _ : _ made of the two involved species, and with capabilities pi. Similarly, operator =jU 
means that a species can form a covalent bond with a N, resulting in a compound of the form 5i 52 

with capabilities p.. The operator for covalent modification =5-pL, similarly produces a compound qS 

N 

with capabilities p. The operator for cleavage _r* means that a molecule can break the covalent bond 
specified by N, where N has to be either of the form of Ni N 2 or qN. Finally, there are the operators — *-P, 
for expressing a conversion of a molecule into other molecules, and — >P for a lossless production of 
molecules. In both cases, the resulting molecules are represented by a process P. 

We allow recursive definitions of capabilities, by means of the recursion operator rec. As always, 
rec x.p binds the free occurrences of the variable name x in p. We assume a substitution function p [n'/x] 
for replacing each free occurrence of x in /i with fx'. The substitution function is also extended to pro- 
cesses. We use the notation rec x./j. with x = xi,...,x n £ Var* as an abbreviation for rec x\ . • ■ ■ .rec x n .pt. 

Names v,l are used to express contingencies on the application of an operator, depending on the 
species appearing in the environment. The former, v, expresses the species that must be present (promot- 
ers), while the latter, 1, expresses those that must be absent (inhibitors). We omit writing contigencies 
when they are empty. 

To give an example of a term in which recursive capabilities are used, let us consider a system in 
which substrate A is transformed into product C by the enzyme E. The MIM diagram in Figure [6] shows 
that enzyme E binds to A and the complex E : A is subsequently transformed into C and E, thus recreating 
the enzyme. The enzyme E can be modeled in MIMc by the following term: rec x.{-^{ — >(x.E \ 
0.C)}}.E. 

Definition 2.2 (Structural congruence). The congruence relations = x on the syntactical categories x £ 
{P,S,IS,p.,M, 7} of the calculus are the least equivalence relations closed under syntactical operators 
and such that the following laws hold: 

I. P l \P 2 = P P 2 \P l , P l j (P 2 j P 3 ) = P (Pi I P 2 ) I P 3 , P|0= P P; 
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Figure 6: An example of recursive MIM 



2. Si : 52 =is S 2 : Si, SiS2=/sS2Si; 

3. M 1 +M2=mM 2 +Mi,M 1 + (M2+M 3 ) = m (M 1 +M 2 )+M 3 ,M + 0= m M,M + M= m M; 

4. (a-conversion) jii =^ ji 2 if they differ only on bound names; 

5. rec x./J, =fi /J,[rec x./x/x]. 

We omit the indication of x in = x when no ambiguities arise. 

We propose now a reduction semantics for the MIM calculus, given in terms of a Labelled Transition 
System (LTS) representing the possible evolutions of a term. The labels of the LTS are actions identifying 
the reactions that each single transition models. All the possible actions Act are of the following forms: 

(a) Ni~N 2 , (b)JVi^ 2 , (c) N—*{N U . . . ,N k ), (d) N—>{N U . . . ,N k }, (e) N l =N 2 , (f) N^NM, (g) 
q =>N, (h) N^+qN\ , which respectively represent (a) the creation of a non-covalent bond, (b) the cleavage 
of a non-covalent bond, (c) a conversion, (d) a lossless production, (e) the creation of a covalent bond, (f) 
the cleavage of a covalent bond, (g) a covalent modification, (h) the removal of a covalent modification. 

(X 

Definition 2.3 (Reduction semantics). The reduction semantics of MIM calculus is the relation — > on 
processes such that 

P^P> iff 3ie,yV.P^^P' (1) 

where a G Act is an action that represents the capability of P used for the reduction step, and ^'^"i , 
with v,l € <yf, is the least relation on processes, closed under structural congruence =p, and satifying 
the following inference rules: 



^ = {X + (v,i)^ii} a=[Si]~[S 2 ] LS!j,LS 2 J0i 

_, 1 _ (v,i) a , , 

/ii.5i I M2-S 2 — ►MMi-Si :jU2-S 2 ) 

a = LSi] </> LS 2 J 

jU.(Si :S 2 ) >Si I S 2 

M = {X + (v, t) — P} a = [S\ —> [P\ [S\ 1 

„ (v,i) a „ 

jX.S y ' P 

H = {X+ (v, 1) — >/>} a = [S\ — > [P] LSJ g" 1 

Ai .s^ Ai .<>| J p 

Mi = {x + (v,t)ii^ i u} «=LSiJ = ls 2 J [Si],[s 2 ]gi 

(v,i) a 



(2) 
(3) 

(4) 

(5) 

(6) 



M1.S1 |/i 2 .S 2 ^-^ju.(/ii.Si)(/i 2 .S 2 ) 
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\SiS 2 \ 

jU = {X + (v, i) } a = [S\ ^ \SiS 2 \ [S\ , \SiS 2 \ <£ i 



ju.5 | n'.S x S 2 11. S | 5i | S 2 

Hi = {X + (v,i)^h} a = q^[S l \ L^iJ^i 



(v,i) a —7 FT 



[qS 



H = {X + (v,i) _r* } a = LSJ -*» gSiJ [S],[qSi\^i 



(v,i) a 

H.S\n'.qSi ) H-S I Si 

plXA^f [gjni=0 v ' = v\[ej 

(v',i) a 



(7) 
(8) 

(9) 
(10) 



Rule|2]deals with the creation of a non-covalent bond between molecules Hi .Si and }X 2 .S 2 , thus giving 
rise to a complex H-(Hi-Si : pL 2 -S 2 ). Note that the rule requires that molecule Hi .Si has the capability 
of binding with a molecule with name LS2J (the symmetric capability is not required for pi 2 .S 2 ). Rule 
[3] deals with the cleavage of a non-covalent bond. There are no conditions for the cleavage. Rule [4] 
deals with the conversion of a molecule H-S into a number of other molecules when /J..S has the proper 
capability. Rule [5] deals with the lossless production, namely with the case in which fi.S produces a 
number of molecules without disappearing. Rules [6] and [TJ are the analogs of rules |2] and [3] for the case 
of covalent binding. The unbinding, expressed by rule|7J requires the presence of a molecule /J..S having 
the capability of breaking the bond. Rules [8] and [9] deal with molecule covalent modification of a type q 
and with the removal of the modification, respectively. 

All the rules are applicable only if the molecules involved are not inhibitors of the transformation 
itself. Rule [10] is used to apply a step of the reduction to the parallel composition of processes. The 
conditions of the rule ensure that the step is not forbidden by any of the molecules present in the com- 
position. Moreover, these conditions, together with the fact that we need that the set v of promoters is 
empty to actually do the reduction step (defined inQ}, ensure that all the promoters of the capability used 
for that step are present in the parallel composition of processes. As usual, we define — >* as the reflexive 

(X 

and transitive closure of relation — ». 

We now define contexts, which represent terms with a hole, denoted as □. The hole corresponds to 
the collection of capabilities of a molecule, and therefore the hole occurs in the position of a capability 
H- Conceptually, contexts allow identifying the position inside a term in which a molecule of a certain 
species appears. 

Definition 2.4 (Context). Contexts of MIM calculus are defined by the following grammar: 



c 


::= P\S C 




Yc "= (v,t 


N 

) — >H C 


(non-covalent binding) 




::= n,.JS 


| ll.IS c 


(v,i) 


N 

=Hc 


(covalent binding) 


IS C 


::= S:S C . 


qS c SS C 


(v,i) 


1 

^Hc 


( covalent modification ) 


He 


::= □ 


M + y c 


(v,0 
(v,i) 


— >c 


(conversion) 
(lossless production) 



The syntax ensures that exactly one hole is present in a context. Given S c , the hole can occur either 
in the capabilities of S c itself, when S c = He -IS, or in the capabilities of one of the molecules of which 
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the molecule is composed, S c = >i.IS c . Given a capability with a hole ti c , the hole can be either the 
capability itself □, or it can occur in one of the capabilities appearing in ii c . In particular, if ii c = y c (a 
basic capability) the hole can occur inside the capabilities of the species y c allows producing. Given a 
context C, its hole can be substituted with a capability jU, giving a process denoted C[ii\. 

For example, the context C{ = /^.(/^A : d.B) represents a molecule complex A : B in which the 
hole refers to the capability of B forming the complex. Context C\ can be applied to a capability /X3 
obtaining C\ [/X3] = /^.(/^.A : pL^.B). Note that in this case, the hole is relative to a species named B, and 
this is clearly visible from the syntax of the context. However, in other cases, the name of the species 
relative to a hole is not directly present in the syntax of the context. For example, the hole in context 

C2 = { — H >P}.A is relative to the species obtained as a complexation between A and B, whose 

name is A : B, which is not directly present in the syntax of the context. In order to extract, from a given 
context, the name of the species relative to the hole, we use a function name : ^£ — ► JV defined as follows. 

Definition 2.5. Function name, from contexts to molecular names Jf , and name' (^l c ,N) = N, from a 
context ii c and a name jV to a name , are recursively defined as follows: 









name N) 


= N 


(17) 


name(P S c ) 


= name(S c ) 


(11) 


name ' ((v,t) — ►ju C) -W) 


= name'(il c ,N : N') 


(18) 


name ((v ,l)^fl c ,N) 




name(il.IS c ) 


= name(IS c ) 


(12) 


= name'(fl c ,NN') 


(19) 


name(S : S c ) 


= name(S c ) 


(13) 


name' ( ( V , I ) =^>jU c , N) 


= name' \il c ,qN) 


(20) 


name(qS c ) 


= name(S c ) 


(14) 


name'((v,l) — ►C,iV) 


= name(C) 


(21) 


name(SS c ) 


= name(S c ) 


(15) 


name ( ( V , I ) — >C,N) 


= name(C) 


(22) 


name(fX c lS) 


= name'(iJ. c , [IS\ ) 


(16) 


name'(M + y c ,N) 


= name(y c ,N) 


(23) 



Definition of function name is given by two mutually recursive functions name and name'. In par- 
ticular, name' takes two parameters, a capability context ii c and a name N, where Af is the name of the 
species with which this capability is associated. The function name' is used in equation [161 where ex- 
tracting the name, relative to the hole, from a context jJ. c .IS is reduced to extracting the name from ;U C , 
knowing that the capability \i c is relative to a species named [IS\ . 



Example Consider an example of a MIM process which represents a molecular system described by 
the MIM diagram shown in Figure |7J 



A 


A:B 


R ] 


< — -m — ► 



>(A:B):C 



Figure 7: A MIM diagram. 



A MIM process, differently from a MIM diagram, represents both the possible interactions among 
the species and the number of molecules that are present in the system. The following MIM process 
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corresponds to a system with species A, B and C, with the interaction capabilities described by the 
diagram in Figure|71 and in which there are two molecules of A, two of B and one of C: 



0}.B | 
0}.B | 
{^0}.C 

In the process P, the species A can complex with B, producing a molecule able to complex with C. 
Species B is able to perform the same reaction of A, and it can also be phosphorilated. Finally, species 
C can complex with A : B. Recall that the non-covalent bond of A : B and C : (A : B) can dissociate 
autonomously as described by rule[3]of the reduction semantics. 

3 Consistency 

In this section we investigate the relationship between Molecular Interaction Maps and MIM Calculus. 
We propose some consistency definitions, with the aim of identifying the terms of the calculus which 
could be formal representation of a MIM diagram. The first difference between MIM diagrams and the 
MIM calculus is that diagrams provide a static view of the species of a system, and of the interactions 
which can occur among the species, while MIM calculus allows representing single molecules and pro- 
vides a semantics for deriving the evolution of the described system. MIM diagrams are also restricted 
with regards to the capabilities of species, since each single molecular species can appear only once in a 
diagram. For this reason, the informal interpretation of MIM diagrams assumes that the capabilities of 
each molecule depend only on the species of the molecule. Capabilites are irrespective, for example, of 
the different reactions that might produce a molecule of that species. On the contrary, the MIM calcu- 
lus allows representing single molecules pt .IS, and different molecules of the same species might have 
different capabilities. For example, term P = { — >/A\}.A \ 0.A \ P' contains two molecules of species A 
with different capabilities: the first one can bind to another molecule of species B, while the second one 
has no capabilities. Note that these two molecules of species A could have been obtained as a result of 
other reactions (for example, by transformation of other molecules), hence during any evolution of the 
system there may be some states in which all the molecules of the same species have the same capabili- 
ties while, in other states, this is not true. It appears to be of particular interest to establish which terms 
of MIMc represent MIM diagrams, in the sense that a MIM diagram can be associated with a term, and 
the term evolves in accordance with the behavior intended by the diagram. One may also ask that in a 
term molecules of a certain species always have the same capabilities. This captures the constraint of 
uniqueness of species in MIM diagrams. 

For this purpose, we present three different definitions of consistency of MIM calculus terms, namely 
semantic consistency, (weak) syntactic consistency, and strong syntactic consistency. Semantic consis- 
tency is the weakest form of consistency, and takes into account only terms that can be reached from the 
initial state. This form of consistency requires that, whenever a molecule of a certain species named N is 
produced (i.e. a molecule S, with [S\ = N, appears in the top-level parallel composition), it always has 
the same capabilities. 



P={^{-^0}}.A 
{^{^0} + - 
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Definition 3.1 (Semantic Consistency). A term P is semantically consistent iff 

VfliSuHz-Sz^yP". [Si\ = [S 2 \ and P^* ii l .S l \ P' and P ->* \i 2 .S 2 | P" implies \i\ = ii 2 

The following definitions of syntactic consistencies deal instead with the species that syntactically 
appear in a term. The weak form requires that the capabilities of each molecule of a species appearing 
in the term, including those forming compound molecules and those that can be obtained as the result of 
reactions, always have the same capabilities. The definitions make use of contexts, in order to precisely 
identify the position inside the term in which molecules of a species (with their capabilities) appear. 

Definition 3.2 ((Weak) Syntactic Consistency). A term P is (weakly) syntactic consistent iff 

MC\,C 2 ,il\,il 2 . name(C\) = name(C 2 ) and P = C\ [jUi] = C2[jU2] implies ii\ = jx 2 

Strong syntactic consistency adds a further constraint, by requiring also that, whenever a non- 
covalent bond ( — >) or a covalent bond (=) can be created between two species, then both species 
have the corresponding capability. In the definition, we write y G ii as a shorthand for 3M. il = {M + y}. 

Definition 3.3 (Strong Syntactic Consistency). A term P is strongly syntactic consistent iff 

• P is weakly syntactic consistent and 

• VCi,C 2 , J Ui,At2,A r i,A^2- 

name(C\) = N\ and name(C 2 ) = N 2 and P = C\ [jUi] = C2[jU2] implies 



For example, term P 2 = { — >Hi}-A | 0.B is weakly syntactic consistent, but not strongly syntactic 
consistent, since molecule B does not have the capability of binding (with a non-covalent bond) to A. 

B A 

The strongly syntactic consistent term corresponding to P 2 is P 3 = { — >Hi}-A \ { — >ili}-B. 
The following proposition shows that syntactic consistency implies semantic consistency. 

Proposition 3.1. Syntactic Consistency entails Semantic Consistency, that is 



Proof. It is sufficient to prove that P — >* /J..S | P' implies 3C. C[fl] = P and name(C) = [S\ . This proof 
is done by induction on the length of the sequence of transitions P — >* jj..S \ P' . Let us assume that such 
a sequence has the following form P = Pq — ^* Pi • • • — ^ P n = ii.S\P' . 

As regards the base case (n = 0), we have P = jj..S \ P'. Then, context C = O.S \ P' is such that C[ju] = P 
and name(C) = [S\ . 

As regards the induction step, let n > 1 and suppose that the property holds for all m < n. We have 
P, = /I..S | P' and there are two cases to be considered: either /J..S already appeared before, i.e. 3n < 
n. Pn = fi.S \ P" for some P" , or not. In the first case, by induction hypothesis, there exists a context C 
such that C[jli] = P and name(C) = [S\ . In the second case, il .S has been created in the last execution step 
Pj-i — ^ A 4 $ I P'> where /J..S does not occur in the top-level parallel composition, i.e. $Q. P n -\ = {J..S \ Q. 
According to the semantics, P„_i H.S | P' iff P n -i h /x.S | P' for some i G By rule induction 

on the rules l2l-fT0l of the semantics, we prove that, for all transitions Q ^ v ' l ^ a " ) Q' t f or anv ^ s created in 
the transition there is a context C such that C[ju] = P and name{C) = [S\ . 




VP. P w syntactically consistent ^> P « semantically consistent. 
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• Rule\2\ Hi. Si | /x 2 .5 2 "> jU.(jUi.5i : ix 2 .5 2 ) with jUi = {X + (v,l)-^ju} 

By induction hypothesis, there exists a context C such that C\jXi] = P and name(C) = \S\\. There- 

fore, context C = C[{X + (v, l ) J -AD}] is such that C[/x] = Pand name(C) = [{jli-Si : ^2-^2)] • 

• /?wZe|3 fi.(ni.Si : /x 2 .5 2 ) (0 ' ' > "> jUi .5i | H2.S2 

Let C be the context of xx.(xXi.5i : H2S2) (by induction hypothesis). Context C must contain the 
portion □ .(/Xi.5i : H2.S2). Suppose that at least one of pLy.Si and H2S2 never appeared before in 
any P m , m<n (otherwise, by induction hypothesis, their contexts are already known). Hence, term 
S = jU.Qui.Si : H2S2) has not been obtained by applying rule [2 but by one of the rules 13 14151 17 191 
This means that 5 appeared literally in the inital term P, thus contexts C\ can be obtained from C 
by replacing □.(/Xi.5i : XX2.52) with /x.(D.5i : H2S2) and contexts C2 can be obtained from C by 
replacing □.(/Xi.5i : H2S2) with /x.(iXi.5i : U.S2). Contexts Ci and C2 are such that Ci[/ii] = P 
with name(Ci) = \Si\ and C2 [1X2] = P with name (C2) = |_52_|- 

• Rule® H1.S1 n.S \ P' with Hi = {X + (v,l)—-(n.S \ P')} 

Let C be the context of fr-Si. Then the context for ix.5 is C = C[{X + (v,t) — ►(□.5 | P')}]- 

• Rule\5\ analogous to rule|4] 

• Rule® analogous to rule [2] 

• Rule\7\ H1.S1 I ^'.(/i2-5 2 )(Ai3-53) — "> I ^2-^2 | H3S3 

Similarly to rule[3l if either or H3S3 did not appear before, their contexts can be obtained 
from context C of /x' .(/X2. 52) (/X3 .53). Context C must contain □ . (/X2 .52) (/X3 .53). We obtain context 
C by replacing □.(/X2.5 2 )(/X3.53) with /x'.(n.52)(/X3.5 3 )) in C for /X2.5 2 . Similarly, we obtain 
context C by replacing □.(/X2.5 2 )(/X3.5 3 ) with /x'.^^XD^)) in C for /X3.53. 

• P«Ze[!J /Xi.5i (va) "> /x.^(/Xi-5i) with /x t = {X + (v, i)=^/x} 

Let C be the context of \X1.S1. Then the context for \i.q{\x x .Si) is C = C[{X + (v,l)=^D}]. 

• Rule® analogous to rules [3] and [7] 

• RuleM P\Q^^P' \Q 

Since Q is not modified by the transition, only terms jx.S in P' could have been created by the 
transition. The contexts of any /x.5 in P' is given by the induction hypothesis on the rule. □ 

4 An Example of Modeling 

In this section we show an example of a real molecular interaction map, taken from [15'], and we show 
the corresponding term in MIM calculus. Differently from Kohn maps, the MIM calculus can contain 
multiple molecules of a same molecular species, thus it can describe the evolution of the system starting 
from an initial configuration. 

The example in [ 15] presents a comprehensive molecular interaction map of regulators of cell cycle 
and DNA repair processes. The presented map is limited to the events in the mammalian cell nucleus. 
We consider here only the interaction between a protein of the E2F family and a gene promoter E2. This 
interaction is an important part of the cell cycle. The transcription of the gene is activated or inhibited by 
the binding of different complexes with the promoter. The molecular interaction map representing the 
interactions among stimulatory and inhibitory complexes of E2F1, DPI and pRb is shown in Figure [8] 
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Figure 8: Molecular interaction map representing interactions among E2F1, DPI, and pRb. 



The E2F1:DP1 dimer (indicated by node (a) in Figured]) and the (E2Fl:DPl):pRb trimer (node (b)) 
can be bound to the promoter element E2. When the E2F1:DP1 dimer is bound to E2 the transcription 
activity is stimulated, while when the (E2Fl:DPl):pRb trimer is bound to E2 the transcription is inhib- 
ited. We represent each species involved by a MIMc term with the capabilities of the species itself. In 
particular we use particular terms for representing the promoter E2 and the DNA. In the Kohn map the 
DNA is implicitly represented, but in a MIMc term DNA must be represented explicitly, for assigning 
to it the capability of producing the mRNA. Remark that we can have multiple copies of the species 
E2F1, DPI, and pRb in a MIMc term representing the system. Coherently with the cell system we can 
have only one copy of the DNA and of the gene promoter E2. Each basic element is identified by an 
elementary species name E2FI, DP 1 , pRb ,E2, DNA , mRNA G S. 

The E2F1 species can be represented by the following term: 

r DPI ( E2 pRb , E2 ... 

{ >{ — >0 H >{ — >0 }} }.E2F\ = \i\.E2F\ 

which states that E2F\ can be bound to DPI, and then the dimer can be bound to E2. Note that the 
stimulation of DNA transcription by the trimer is not modeled among the capability of the species, which 
are just empty. Instead, this behavior is captured by the DNA process, shown in the following. Moreover, 
the E2FI : DPI dimer can be bound to pRb and then to promoter E2 to inhibit the transcription. As for 
stimulation, inhibition is captured inside the definiton of the DNA. Species DP 1 and pRb are represented 
by the terms: 

{^{^0 + ^{^0}}}.DPl =At2 .DPl 
{^^{^0}}.pRb = n 3 . P Rb 
Finally, the promoter E2 and the DNA can be represented by the terms: 

, E2FV.DPI (E2Fl:DPl):pRb 

{ >0 H >0\.E2 = ii 4 .E2 



{(vdna,Idna) — >mRNA}.DNA with 



V DNA = {(E2F\:DP\):E2} 

Idna = {{{E2F\ : DPI) : pRb) : E2} 



The lossless production of mRNA by the DNA is regulated by the presence/absence of the two complexes 
{E2F\ : DPI) : E2 and {{E2F\ : DPI) : pRb) : E2. In particular, the former complex represents a 
promoter (triggering the reaction), while the latter represents an inhibitor for the reaction. 
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An initial configuration in which two molecules of species E2F1, DPI and pRb are present is repre- 
sented by the following MIMc term 

Pi = ]X X .E2F\ | \x x .E2F\ | \X 2 .DP\ | \i 2 .DP\ \ ]X 3 .pRb \ }l 3 .pRb \ ]X A .E2 
I { ( Vdna , Idna ) —>mRNA } .DNA 

The term can evolve towards different configurations. For example, after a complexation between E2F 1 
and DPI occurs, the processes \i\.E2F\ and \i 2 .DP\ are replaced by the following term, representing a 
complex with name E2FI : DPI: 

{^0 + -^{-^0}}.(H 1 .E2Fl:n 2 .DPl) 

Thus the whole term becomes: 

P 2 = VI.E2FI I ji 2 .DP\ I ji 3 .pRb I n 3 .pRb \ \l A .E2 \ {(v DNA ,l DNA ) — >mRNA}.DNA 
I {-^0 + -^{-^0}}.(ni-E2Fl : Hi-DPl) 

As a further evolution step we may have the binding of the dimer E2FI : DPI to the promoter E2. The 
resulting term is: 

P 3 = Hi.E2Fl I H2-DPI I ]X 3 .pRb \ ]X 3 .pRb \ {(v DNA ,l DNA ) — >mRNA}.DNA 
I 0. f({-^0 + ^{^0}}.( jUl .£2Fl : H 2 .DP1)) : ^.E2 



As an example of derivation, we show how the semantics is applied to the term Pi above obtaining 
the term P 2 in a single reduction step. For the sake of readability, we write the terms Pi, P2 as: 

P X =\L V .E2F\ \\l 2 .DP\ \R 
P 2 = }X.{ni.E2Fl :n 2 -DP\) \ R 

where 

R = Hi.E2Fl I ]X 2 .DP\ I \x 3 .pRb \ ]X 3 .pRb \ }i A .E2 \ {(v DNA ,l DNA ) — >mRNA}.DNA 

r El „ pRb r El ~ , 

H = {-^0 + ^{-^0}}. 

The transition Pi V '' > P 2 , with V = l = 0, is obtained with the following derivation, by using the rules 
of the semantics: 

jui = {X + (v,i)-^ju} y = i = E2F\,DPl£i 

^.E2Fl\^DPl gj E2Fl - DPl ^ .(^.E2Fl^ 2 .DPl) L*]ni = v' = v\[*]=0 

(V l) ElFl^DPl 

Hi.E2Fl \ix 2 .DP\ \ R— >n.(Hi.E2F\ : pL 2 .DP\) \ R 

Finally, according to property (Q}, we have the transition Pi £2F1< ~ >Z?P1 > p 2 
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5 Conclusions 

Using formal methods for studying biological systems is an interesting approach that allows using many 
different analiysis techniques. We have defined the MIM calculus, a new calculus with high-level op- 
erators directly inspired by Molecular Interaction Maps (MIM), a graphical notation used in biology. 
This approach allows exploiting the features of process calculi, such as incremental definition of models, 
techniques for analysis and verification of properties, and the development of simulators. Moreover, the 
correspondence of the operators of the calculus with biological reactions allows an easy translation of 
Molecular Interaction Maps into terms of the MIM calculus. We have studied conditions under which a 
term of the MIM calculus is a formal representation of a MIM diagram, and we have provided different 
consistency definitions for the terms of the MIM calculus. 

In the future, we plan to investigate the properties of calculus, such as its expressiveness, and to 
develop different extensions for a better description of biological systems. 
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